Using Intel Streaming SIMD Extensions for 3D Geometry Processing
نویسندگان
چکیده
Three dimensional (3D) graphics applications is an important workload running on today’s computer system. A cost-effective graphics solution is to use a general processor for 3D geometry processing and a specialized hardware for rasterization. 3D geometry processing is an inherently parallel task. Therefore, many CPU vendors add SIMD (Single Instruction Multiple Data) instruction extensions to accelerate 3D geometry processing. In this paper, we evaluate the performance impact of using the Intel Streaming SIMD Extensions (SSE) for 3D geometry processing. We use SIMD-FP to improve the computational throughput by processing four vertices in parallel. We find that the layout of vertices in memory is important for the effectiveness of SIMD-FP. We also study the effect of using prefetch instructions to improve the memory performance. The experimental results show that using Intel SSE can achieve close to 4x speedup for geometry processing.
منابع مشابه
Optimized CPU-based Skinning for 3D Games
Introduction Lifelike 3D character models play an increasingly important role in many computer games. Organic models, such as people, are more complex to render than rigid bodies because the mesh that defines the shape of the model constantly changes as the model animates. This animating mesh is referred to as a ‘skin’ since it’s influenced by the underlying structure of the object; ‘skinning’ ...
متن کاملReal-time full-programmable fuzzy processor on an Intel Pentium III
This paper presents a real-time fullprogrammable fuzzy processor using piecewiselinear intelpolation techniques and implements it using the SSE (Streaming SIMD Extensions) set of instructions of an Intel Pentium III. A fullprogrammable fuzzy processor is defined as a system where the set of rules, the membership finctions, the t-norm, the t-conorm, the aggregation operator, the propagation oper...
متن کاملAn Implementation of Parallel 1-D FFT Using SSE3 Instructions on Dual-Core Processors
In the present paper, an implementation of a parallel one-dimensional fast Fourier transform (FFT) using Streaming SIMD Extensions 3 (SSE3) instructions on dual-core processors is proposed. Combination of vectorization and the block six-step FFT algorithm is shown to effectively improve performance. The performance results for one-dimensional FFTs on dual-core Intel Xeon processors are reported...
متن کاملThe Sse Provides a Rich Set of Instructions to Meet the Requirements of Demanding
0272-1732/00/$10.00 2000 IEEE In the volume PC market, the demand is growing for general-purpose processors that facilitate visual and graphical computing on the Internet. Responding to this need, Intel developed the streaming SIMD extensions (SSE), a set of processor instructions designed to boost performance of multimedia and Internet applications, and implemented them on its Pentium III pr...
متن کاملFast Trigonometric Functions Using Intel’s Sse2 Instructions
The goal of this work was to answer one simple question: given that the trigonometric functions take hundreds of clock cycles to execute on a Pentium IV, can they be computed faster, especially given that all Intel processors now have fast floating-point hardware? The streaming SIMD extensions (SSE/SSE2) in every Pentium III and IV provide both scalar and vector modes of computation, so it has ...
متن کامل